Identifiability of the proportion of null hypotheses in skew-mixture models for the p-value distribution
نویسندگان
چکیده
In many multiple testing procedures, accurate modeling of the p-value distribution is a key issue. Mixture distributions have been shown to provide adequate models for p-value densities under the null and the alternative hypotheses. An important parameter of the mixture model that needs to be estimated is the proportion of true null hypotheses, which under the mixture formulation becomes the probability mass attached to the value associated with the null hypothesis. It is well known that in a general mixture model, especially when a scale parameter is present, the mixing distribution need not be identifiable. Nevertheless, under our setting for mixture model for p-values, we show that the weight attached to the null hypothesis is identifiable under two very different types of conditions. We consider several examples including univariate and multivariate mixture models for transformed p-values. Finally, we formulate an abstract theorem for general mixtures and present other examples. AMS 2000 subject classifications: Primary 62E10; secondary 62G99.
منابع مشابه
The Family of Scale-Mixture of Skew-Normal Distributions and Its Application in Bayesian Nonlinear Regression Models
In previous studies on fitting non-linear regression models with the symmetric structure the normality is usually assumed in the analysis of data. This choice may be inappropriate when the distribution of residual terms is asymmetric. Recently, the family of scale-mixture of skew-normal distributions is the main concern of many researchers. This family includes several skewed and heavy-tailed d...
متن کاملA moment-based method for estimating the proportion of true null hypotheses and its application to microarray gene expression data.
Due to advances in experimental technologies, it is feasible to collect measurements for a large number of variables. When these variables are simultaneously screened by a statistical test, it is necessary to consider the adjustment for multiple hypothesis testing. The false discovery rate has been proposed and widely used to address this issue. A related problem is the estimation of the propor...
متن کاملDetermination of the number of components in finite mixture distribution with Skew-t-Normal components
Abstract One of the main goal in the mixture distributions is to determine the number of components. There are different methods for determination the number of components, for example, Greedy-EM algorithm which is based on adding a new component to the model until satisfied the best number of components. The second method is based on maximum entropy and finally the third method is based on non...
متن کاملEvaluation of prognostic factors affecting long and short term survival rates of Hodgkin's lymphoma patients using the cure fraction models
Background and Aim: This study aimed to analyze the factors affecting time and experience of relapse in the patients with Hodgkin's lymphoma, using cure fraction. Material and Methods: This retrospective study included all the patients diagnosed as Hodgkin's lymphoma in the Center for oncology and hematology in Shafa Hospital in Ahwaz City from 2002 to 2012. We used survival analysis and cure f...
متن کاملParameter Estimation in Spatial Generalized Linear Mixed Models with Skew Gaussian Random Effects using Laplace Approximation
Spatial generalized linear mixed models are used commonly for modelling non-Gaussian discrete spatial responses. We present an algorithm for parameter estimation of the models using Laplace approximation of likelihood function. In these models, the spatial correlation structure of data is carried out by random effects or latent variables. In most spatial analysis, it is assumed that rando...
متن کامل